Add properties in Evaluation Result - Custom Evaluator extra fields. by w-javed · Pull Request #46077 · Azure/azure-sdk-for-python

w-javed · 2026-04-02T07:11:14Z

Evaluation Result is OpenAI complaint. It contains score, label, reason etc.
We have a need to present more eval results to UI. So, introducing this property bag, in which we can add more details. Science team is also going to use this property bag, which can show more outputs per evaluator.

Copilot

Pull request overview

Adds support for passing through custom evaluator “extra fields” via a properties bag into the AOAI-style evaluation result objects produced by the evaluation results converter.

Changes:

Update _extract_metric_values to detect an outputs.<criteria>.properties dict and propagate it onto per-metric extracted values.
Update _create_result_object to include properties in the final AOAI result payload when present.
Add a unit test asserting properties is preserved and not flattened into the top-level result object.

Reviewed changes

Copilot reviewed 2 out of 2 changed files in this pull request and generated 2 comments.

File	Description
sdk/evaluation/azure-ai-evaluation/azure/ai/evaluation/_evaluate/_evaluate.py	Propagates a per-criteria `properties` dict into per-metric result objects during AOAI conversion.
sdk/evaluation/azure-ai-evaluation/tests/unittests/test_evaluate.py	Adds coverage validating `properties` passthrough behavior for a custom evaluator result row.

sdk/evaluation/azure-ai-evaluation/azure/ai/evaluation/_evaluate/_evaluate.py

…esults Pass through evaluator properties dict in AOAI evaluation results. When an evaluator returns a properties dict, it is included alongside score, label, reason, threshold, and passed in the result object. Co-authored-by: Copilot <223556219+Copilot@users.noreply.github.com>

Update _extract_metric_values and _create_result_object docstrings to document the new properties field and its expected dict type. Co-authored-by: Copilot <223556219+Copilot@users.noreply.github.com>

sdk/evaluation/azure-ai-evaluation/azure/ai/evaluation/_evaluate/_evaluate.py

Address PR review: warn users when their custom evaluator returns 'properties' as a non-dict type so they can fix the output format. Also add properties to _create_result_object example input. Co-authored-by: Copilot <223556219+Copilot@users.noreply.github.com>

…thub.com/Azure/azure-sdk-for-python into waqasjaved02/aoai-properties-passthrough

Remove erroneous space in self._eval_metric. value (two occurrences) that would cause an AttributeError at runtime when building result keys for _details and _total_tokens fields. Co-authored-by: Copilot <223556219+Copilot@users.noreply.github.com>

Co-authored-by: Copilot <223556219+Copilot@users.noreply.github.com>

nagkumar91

Review — Properties passthrough

Shared dict reference (bug-risk) — In _extract_metric_values, the same properties object is assigned to every metric entry:

for metric_dict in result_per_metric.values():
    metric_dict["properties"] = properties  # same object reference

If anything downstream mutates one entry's properties, all entries are affected. Consider metric_dict["properties"] = properties.copy() (or copy.deepcopy if nested dicts matter).

No test for the warning path — The isinstance(metric_value, dict) guard logs a warning when properties isn't a dict, but no test covers this branch. A quick test passing properties="not_a_dict" would confirm the warning fires and properties is omitted.
Typo fixes in _base_rai_svc_eval.py — Good catch on self._eval_metric. value → self._eval_metric.value. Cosmetic (Python allows whitespace after dot) but worth cleaning up.

- Use properties.copy() to avoid shared dict reference across metrics - Add test for non-dict properties logging and omission - Change properties type mismatch log level from warning to info Co-authored-by: Copilot <223556219+Copilot@users.noreply.github.com>

nagkumar91

All review items addressed — properties.copy(), non-dict test coverage, and log level adjustment. LGTM.

…46077) * feat(evaluation): support properties passthrough in AOAI evaluation results Pass through evaluator properties dict in AOAI evaluation results. When an evaluator returns a properties dict, it is included alongside score, label, reason, threshold, and passed in the result object. Co-authored-by: Copilot <223556219+Copilot@users.noreply.github.com> * docs: update docstrings for properties passthrough per PR review Update _extract_metric_values and _create_result_object docstrings to document the new properties field and its expected dict type. Co-authored-by: Copilot <223556219+Copilot@users.noreply.github.com> * fix: log warning when properties is not a dict Address PR review: warn users when their custom evaluator returns 'properties' as a non-dict type so they can fix the output format. Also add properties to _create_result_object example input. Co-authored-by: Copilot <223556219+Copilot@users.noreply.github.com> * Release-1-16-4 * Fix stray space in _eval_metric.value attribute access Remove erroneous space in self._eval_metric. value (two occurrences) that would cause an AttributeError at runtime when building result keys for _details and _total_tokens fields. Co-authored-by: Copilot <223556219+Copilot@users.noreply.github.com> * Remove empty changelog sections to fix Build Analyze check Co-authored-by: Copilot <223556219+Copilot@users.noreply.github.com> * Address PR feedback: copy properties dict and add non-dict test - Use properties.copy() to avoid shared dict reference across metrics - Add test for non-dict properties logging and omission - Change properties type mismatch log level from warning to info Co-authored-by: Copilot <223556219+Copilot@users.noreply.github.com> --------- Co-authored-by: Copilot <223556219+Copilot@users.noreply.github.com> Co-authored-by: Sydney Lister <sydneylister@microsoft.com>

w-javed requested a review from a team as a code owner April 2, 2026 07:11

Copilot AI review requested due to automatic review settings April 2, 2026 07:11

github-actions bot added the Evaluation Issues related to the client library for Azure AI Evaluation label Apr 2, 2026

Copilot started reviewing on behalf of w-javed April 2, 2026 07:12 View session

Copilot AI reviewed Apr 2, 2026

View reviewed changes

sdk/evaluation/azure-ai-evaluation/azure/ai/evaluation/_evaluate/_evaluate.py Show resolved Hide resolved

sdk/evaluation/azure-ai-evaluation/azure/ai/evaluation/_evaluate/_evaluate.py Show resolved Hide resolved

w-javed force-pushed the waqasjaved02/aoai-properties-passthrough branch from 2fd8ce2 to c8f5958 Compare April 2, 2026 07:19

docs: update docstrings for properties passthrough per PR review

e67520c

Update _extract_metric_values and _create_result_object docstrings to document the new properties field and its expected dict type. Co-authored-by: Copilot <223556219+Copilot@users.noreply.github.com>

YoYoJa approved these changes Apr 2, 2026

View reviewed changes

aprilk-ms reviewed Apr 2, 2026

View reviewed changes

sdk/evaluation/azure-ai-evaluation/azure/ai/evaluation/_evaluate/_evaluate.py Outdated Show resolved Hide resolved

w-javed and others added 5 commits April 2, 2026 18:33

Merge branch 'main' into waqasjaved02/aoai-properties-passthrough

af9755e

Merge branch 'main' into waqasjaved02/aoai-properties-passthrough

c77699d

Release-1-16-4

330ec9b

Merge branch 'waqasjaved02/aoai-properties-passthrough' of https://gi…

5ec9b95

…thub.com/Azure/azure-sdk-for-python into waqasjaved02/aoai-properties-passthrough

slister1001 force-pushed the waqasjaved02/aoai-properties-passthrough branch from e9b6431 to 5ec9b95 Compare April 3, 2026 14:24

slister1001 and others added 2 commits April 3, 2026 10:25

Remove empty changelog sections to fix Build Analyze check

1aea082

Co-authored-by: Copilot <223556219+Copilot@users.noreply.github.com>

nagkumar91 reviewed Apr 3, 2026

View reviewed changes

nagkumar91 approved these changes Apr 3, 2026

View reviewed changes

w-javed merged commit cc3eaa7 into main Apr 3, 2026
21 checks passed

w-javed deleted the waqasjaved02/aoai-properties-passthrough branch April 3, 2026 19:52

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add properties in Evaluation Result - Custom Evaluator extra fields. #46077

Add properties in Evaluation Result - Custom Evaluator extra fields. #46077
w-javed merged 10 commits intomainfrom
waqasjaved02/aoai-properties-passthrough

w-javed commented Apr 2, 2026 •

edited

Loading

Uh oh!

Copilot AI left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

nagkumar91 left a comment

Uh oh!

nagkumar91 left a comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

6 participants

Conversation

w-javed commented Apr 2, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Reviewed changes

Uh oh!

Uh oh!

Uh oh!

Uh oh!

nagkumar91 left a comment

Choose a reason for hiding this comment

Uh oh!

nagkumar91 left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

6 participants

w-javed commented Apr 2, 2026 •

edited

Loading